Helping Our Own: Text Massaging for Computational Linguistics as a New Shared Task

نویسندگان

Robert Dale

Adam Kilgarriff

چکیده

In this paper, we propose a new shared task called HOO: Helping Our Own. The aim is to use tools and techniques developed in computational linguistics to help people writing about computational linguistics. We describe a text-to-text generation scenario that poses challenging research questions, and delivers practical outcomes that are useful in the first case to our own community and potentially much more widely. Two specific factors make us optimistic that this task will generate useful outcomes: one is the availability of the ACL Anthology, a large corpus of the target text type; the other is that CL researchers who are non-native speakers of English will be motivated to use prototype systems, providing informed and precise feedback in large quantity. We lay out our plans in detail and invite comment and critique with the aim of improving the nature of the planned exercise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SciSumm 2017: Employing Word Vectors for Identifying, Classifying and Summarizing Scientific Documents

This paper describes our approach on ”Recognizing Reference Spans,Classifying Their Discourse Facets and Summarizing from Reference Text” as an attempt in the shared task on relationship mining and scientific summarization of computational linguistics research papers at SIGIR 2017.

متن کامل

ACL 2016 The 54th Annual Meeting of the Association for Computational Linguistics Proceedings of the SIGNLL Conference on Computational Natural Language Learning: Shared Task

The CoNLL-2016 Shared Task is the second edition of the CoNLL-2015 Shared Task, now on Multilingual Shallow discourse parsing. Similar to the 2015 task, the goal of the shared task is to identify individual discourse relations that are present in natural language text. Given a natural language text, participating teams are asked to locate the discourse connectives (explicit or implicit) and the...

متن کامل

University of Illinois System in HOO Text Correction Shared Task

In this paper, we describe the University of Illinois system that participated in Helping Our Own (HOO), a shared task in text correction. We target several common errors, such as articles, prepositions, word choice, and punctuation errors, and we describe the approaches taken to address each error type. Our system is based on a combination of classifiers, combined with adaptation techniques fo...

متن کامل

Comparative Study of Neural Models for the COSET Shared Task at IberEval 2017

This paper describes our participation in the Classification Of Spanish Election Tweets (COSET) task at IberEval 2017. During the searching process for the best classification system, we developed a comparative study over possible combinations of corpus preprocessing, text representations and classification models. After an initial models exploration, we focus our attention over specific neural...

متن کامل

Producing a Persian Text Tokenizer Corpus Focusing on Its Computational Linguistics Considerations

The main task of the tokenization is to divide the sentences of the text into its constituent units and remove punctuation marks (dots, commas, etc.). Each unit is a continuous lexical or grammatical writing chain that is an independent semantic unit. Tokenization occurs at the word level and the extracted units can be used as input to other components such as stemmer. The requirement to create...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Helping Our Own: Text Massaging for Computational Linguistics as a New Shared Task

نویسندگان

چکیده

منابع مشابه

SciSumm 2017: Employing Word Vectors for Identifying, Classifying and Summarizing Scientific Documents

ACL 2016 The 54th Annual Meeting of the Association for Computational Linguistics Proceedings of the SIGNLL Conference on Computational Natural Language Learning: Shared Task

University of Illinois System in HOO Text Correction Shared Task

Comparative Study of Neural Models for the COSET Shared Task at IberEval 2017

Producing a Persian Text Tokenizer Corpus Focusing on Its Computational Linguistics Considerations

عنوان ژورنال:

اشتراک گذاری